Information Retrieval with Language Knowledge
نویسندگان
چکیده
The introduction of Swedish made it possible for Lexware to be tested for the first time in CLEF. Lexware is a natural language system applied in an information retrieval task and not an information retrieval systems using NLP techniques, therefore it is interesting to compare its results with other less odd IR systems. We experience that separate evaluation of document description and query building would provide yet better testing for our system. 1 A Natural Language System for Swedish Lexware is a natural language system applied in an information retrieval task and not an information retrieval systems using NLP techniques, like e.g. NLIR [4]. It can be considered odd also among natural language processing systems, if the latter are assumed to focus on syntactic analysis (c.f. [3]). Text-analysis is shallow and it is not demanding in terms of computing power and storage [1]. The strength of the system is its rich lexicon and the possibility to expand the lexicon with external items without negative impact on access time [2]. The vocabulary of about 80 000 lexical items is richly interconnected by relations of form and content: derivational origin, synonymy, components for complex items, hyponymy. Content words are categorized into about 100 content categories. There are also supplementary word lists which include about 50 000 non-appellatives like names of people, places, organizations, etc. plus basic glossaries of English, French, German, and Latin. 400 word formation rules cope with inflection, compounding and derivation, 500 general phrase rules plus 700 collocation patterns are used to disambiguate and to determine modifier–head roles. 2 Lexware in Another Information Retrieval Task Lexware has been extensively tested in another information retrieval task. The library of the Swedish parliament – Riksdagsbiblioteket, designed and conducted evaluation of software that could supplement or even substitute manual indexing of the documents of the parliament. The task is to select proper keywords among descriptors in a thesaurus specially created for this kind of documents.
منابع مشابه
Studying the Effect of Retrieval Direction during Reading on Productive and Receptive Knowledge of Vocabulary
Retrieval tasks provide learners with an opportunity to focus both on meaning and on form. There are four different retrieval directions. The present study aimed to identify the optimal direction of recall type retrievals during reading and to investigate the outcomes of each one. Forty-eight intermediate EFL learners took part in the study. One of the experimental groups was provided with the ...
متن کاملComparative Study of Degree of Bilingualism in Lexical Retrieval and Language Learning Strategies
This study compares lexical retrieval amongst monolinguals and intermediate bilinguals and advanced bilinguals. It also investigates the possible effects of their language learning strategies on their respective lexical retrieval advantage. The study used a mixed methods design and the groups consisted of 20 Persian near-monolinguals, 20 Persian-English intermediate level bilinguals, and 20 Per...
متن کاملبررسی تأثیرات ریشهیابی در بازیابی اطلاعات در زبان فارسی
Using the language-specific behavior in information retrieval systems can improve the quality of the retrieved results significantly. Part of the word that remains after removing its affixes is called stem. Stemming process can be used for improving the relevancy of the results in information retrieval system. Different morphological variants of words (plural, past tense…) will be mapped into t...
متن کاملImpact of Controlled and Free Language Use in Retrieving Articles from the ProQuest and Science Direct Databases
Abstract Introduction: The growth and expansion of the Internet has changed the way information is accessed and many facilities have been created on the Web to facilitate and expedite information locating. Objective: To identify the impact of keyword documentation using the medical thesaurus on the retrieval of articles from Proquest and Science Direct databases. Materials and Methods:The pr...
متن کاملPerformance Evaluation of Medical Image Retrieval Systems Based on a Systematic Review of the Current Literature
Background and Aim: Image, as a kind of information vehicle which can convey a large volume of information, is important especially in medicine field. Existence of different attributes of image features and various search algorithms in medical image retrieval systems and lack of an authority to evaluate the quality of retrieval systems, make a systematic review in medical image retrieval system...
متن کاملدیداری کردن نتایج جستوجو در فرایند بازیابی اطلاعات
Purpose: One of the most effective ways to achieve optimum information retrieval is through visualization of Information. Search strategies, probing skills, querying of information needs and analysis of information play a significant role in the accessing of necessary and useful information. Besides the factors mentioned above, information visualization can increase the availability level of in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002